QUERY PROCESSING OVER INCOMPLETE AUTONOMOUS WEB DATABASES by Hemal Khatri
نویسندگان
چکیده
Incompleteness due to missing attribute values (aka “null values”) is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to the user query. Ideally, the mediator is expected to retrieve such relevant uncertain answers and gauge their relevance by accessing their likelihood of being relevant answers to the query. The autonomous nature of the databases poses several challenges in realizing this idea. Such challenges include restricted access privileges, limited query patterns and cost sensitivity of database and network resource consumption in web environment. This thesis presents QPIAD – a framework for query processing over incomplete autonomous databases. QPIAD is able to retrieve relevant uncertain answers with high precision, high recall and manageable cost. Data integration over multiple autonomous data sources is an important task performed by a mediator. This thesis describes query rewriting techniques to perform data integration over multiple incomplete autonomous data sources on the web. Results of experimental evaluation on real-life databases demonstrate that our system retrieve relevant answers with high precision and manageable cost.
منابع مشابه
Query Processing over Incomplete Autonomous Databases
Incompleteness due to missing attribute values (aka “null values”) is very common in autonomous web databases, on which user accesses are usually supported through mediators. Traditional query processing techniques that focus on the strict soundness of answer tuples often ignore tuples with critical missing attributes, even if they wind up being relevant to a user query. Ideally we would like t...
متن کاملHandling Imprecision & Incompleteness in Autonomous Databases
As more and more information from autonomous web databases becomes available to lay users, query processing over these databases must adapt to deal with the imprecise nature of user queries as well as incompleteness due to missing attribute values (aka “null values”) in the database. In such scenarios, the query processor begins to acquire the role of a recommender system. Specifically, in addi...
متن کاملQUIC: Handling Query Imprecision & Data Incompleteness in Autonomous Databases
As more and more information from autonomous databases becomes available to lay users, query processing over these databases must adapt to deal with the imprecise nature of user queries as well as incompleteness in the data due to missing attribute values (aka “null values”). In such scenarios, the query processor begins to acquire the role of a recommender system. Specifically, in addition to ...
متن کاملQUIC: A System for Handling Imprecision & Incompleteness in Autonomous Databases (Demo)
As more and more information from autonomous databases becomes available to lay users, query processing over these databases must adapt to deal with the imprecise nature of user queries as well as incompleteness in the data due to missing attribute values (aka “null values”). In such scenarios, the query processor begins to acquire the role of a recommender system. Specifically, in addition to ...
متن کاملProject C CSE 494 / 598 Hemal Khatri
CSE 494/598 Hemal Khatri Overview This goal of this project is to mine patterns in the search results by clustering the search results. The different methods used for clustering are: 1. K-Means 2. Buckshot 3. Bisecting K-Means. For each of these methods, we show the algorithm used for computing the clusters as well as the time complexity for each of these algorithms. We also compare the quality...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006